The purpose of this project is to uncover and document valued actionable insights which are contained within the available source data for the benefit of the target audience.
The target audience includes coaches, personal trainers, athletes, ahletics governing body officials, interested members of the public, sports enthusiasts, sports statisticians and data scientists.
The objective is to explore Victorian interclub track & field athletic competition results data for the complete 2017-18 summer season and identify:
Important note: This analysis is a not-for-profit independent analysis conducted by Bree McLennan, using publically available data from the Athletics Victoria Website. This analysis does not represent the opinions of Athletics Victoria.
The data for interclub rounds 1 to 12 is contained in individual csv files, by round, for each participating Victorian region.
General description of the source data:
Technical approach to creating the analysis data:
## Observations: 28,799
## Variables: 61
## $ KEYRegistrationNumber <fct> 2559, 10002, 446, 459, ...
## $ NUMDistance <int> NA, NA, NA, NA, NA, NA,...
## $ NUMStandardDistance <int> NA, NA, NA, NA, NA, NA,...
## $ NUMPerformance <dbl> 11.53, 22.93, 4.16, 4.0...
## $ NUMWindReading <dbl> NA, NA, -2.1, -1.4, 2.6...
## $ NUMPointsAwarded <int> 153, NA, 307, 283, 222,...
## $ CATAthleteRegisteredZone <fct> Ballarat, Ballarat, Bal...
## $ CATCompetitionVenue <fct> Ballarat, Ballarat, Bal...
## $ CATGender <fct> Female, Female, Female,...
## $ CATAthleteRegisteredClub <fct> BYC, WEN, BHA, WEN, BHA...
## $ CATAgeGroup <fct> F14, F14, F14, F14, F14...
## $ CATAthleteDivision <fct> 0, 0, 0, 0, 0, 0, 0, 0,...
## $ CATAthleteTeamNumber <fct> 1, 0, 1, 1, 1, 1, 1, 1,...
## $ CATEventDiscipline <fct> Javelin, Javelin, Long ...
## $ CATDistance <fct> NA, NA, NA, NA, NA, NA,...
## $ CATStandardDistance <fct> NA, NA, NA, NA, NA, NA,...
## $ CATEventSpecification <chr> "400g", "400g", "", "",...
## $ CATEventHeat <fct> 1, 1, 1, 1, 1, 1, 1, 1,...
## $ CATEventNote <fct> 400g, 400g, INV, None, ...
## $ ORDCompetitionRound <fct> 1, 1, 1, 1, 1, 1, 1, 1,...
## $ TXTPerformanceFormatted <chr> "11.53m", "22.93m", "4....
## $ BINValidEventAttempt <fct> 1, 1, 1, 1, 1, 1, 1, 1,...
## $ BINInvitationEventOrAthlete <fct> 0, 1, 0, 0, 0, 0, 0, 0,...
## $ BINTeamEvent <fct> 0, 0, 0, 0, 0, 0, 0, 0,...
## $ CATEventFullName <fct> Javelin 400g, Javelin 4...
## $ CATEventGroupL1 <fct> Field, Field, Field, Fi...
## $ CATEventGroupL2 <fct> Throw, Throw, Jump, Jum...
## $ CATAgeGroupLeveL1 <fct> Juniors, Juniors, Junio...
## $ CATAgeGroupKey <fct> , , , , , , , , , , , ,...
## $ BINAthleteWithDisability <fct> 0, 0, 0, 0, 0, 0, 0, 0,...
## $ CATAthleteAWDClass <fct> NA, NA, NA, NA, NA, NA,...
## $ ORDEventFinishOrder <int> 2, 1, 1, 2, 3, 4, 5, 6,...
## $ ORDEventFinishOrderPoints <dbl> 1, 0, 1, 2, 3, 4, 5, 6,...
## $ NUMTotAthletesInVenueAgeEventPoints <dbl> 1, 1, 6, 6, 6, 6, 6, 6,...
## $ NUMTotAthletesInVenueAgeEvent <int> 2, 2, 7, 7, 7, 7, 7, 7,...
## $ ORDRoundEventFinishOrder <int> 11, 4, 10, 13, 21, 26, ...
## $ ORDRoundEventFinishOrderPoints <dbl> 11, 0, 11, 14, 21, 26, ...
## $ NUMTotAthletesInRoundAgeEventPoints <dbl> 13, 13, 32, 32, 32, 32,...
## $ NUMTotAthletesInRoundAgeEvent <int> 13, 13, 33, 33, 33, 33,...
## $ CATClubDistrict <fct> Country, Country, Count...
## $ CATClubZoneName <fct> Ballarat, Ballarat, Bal...
## $ CATAthleticClubName <fct> Ballarat YCW Harriers, ...
## $ CATAthleticTrackVenueName <fct> Llanberris Athletic Res...
## $ CATVenueTrackType <fct> Synthetic, Synthetic, S...
## $ CATVenueAddress <chr> "York St, Golden Point,...
## $ CATVenueDistrict <fct> Country, Country, Count...
## $ CATVenueMapCoord <chr> "-37.5693937,143.865968...
## $ NUMVenueLatitude <dbl> -37.56939, -37.56939, -...
## $ NUMVenueLongitude <dbl> 143.866, 143.866, 143.8...
## $ CATVenueZone <fct> Ballarat, Ballarat, Bal...
## $ CATVenueVictoriaRegion <fct> Ballarat, Ballarat, Bal...
## $ NUMAdjustFactorAWD <dbl> NA, NA, NA, NA, NA, NA,...
## $ NUMAdjustFactorVET <dbl> NA, NA, NA, NA, NA, NA,...
## $ BINAthleteCompeteAwayVenue <fct> 1, 1, 1, 1, 1, 1, 1, 1,...
## $ CATAthleteEventStatus <fct> OK, OK, OK, OK, OK, OK,...
## $ NUMPerformanceAWDAdjusted <dbl> 11.53, 22.93, 4.16, 4.0...
## $ ORDRoundEventFinishOrderAWDAdj <int> 11, 4, 10, 13, 21, 26, ...
## $ ORDRoundEventFinishOrderPointsAWDAdj <dbl> 11, 0, 11, 14, 21, 26, ...
## $ NUMEventFinishOrderPoints11 <dbl> 11, 0, 11, 9, 8, 7, 6, ...
## $ NUMRoundEventFinishOrderPoints11 <dbl> 1, 0, 1, 1, 1, 1, 1, 1,...
## $ NUMRoundEventFinishOrderPoints11AWDAdj <dbl> 1, 0, 1, 1, 1, 1, 1, 1,...
# Participation rates for each round
wrk.03DataTrans_Q1A <- wrk.03DataTrans_03 %>%
filter(KEYRegistrationNumber %ni% c("0")) %>% #remove teams
group_by(ORDCompetitionRound) %>%
summarise(NUMAthletesParticipating = n_distinct(KEYRegistrationNumber),
NUMTotalEventsParticipated = n())
barplot(rev(wrk.03DataTrans_Q1A$NUMAthletesParticipating),
main = "Athlete participation by round",
col = rgb(0.2,0.4,0.6,0.6), horiz = TRUE , las = 1 ,
xlab = "Number of athletes",
names.arg = rev(wrk.03DataTrans_Q1A$ORDCompetitionRound))# Participation by event
wrk.03DataTrans_Q2A <- wrk.03DataTrans_03 %>%
group_by(ORDCompetitionRound, CATEventFullName) %>%
summarise(NUMAthletesParticipating = n_distinct(KEYRegistrationNumber),
NUMTotalEventsParticipated = n())
datatable(wrk.03DataTrans_Q2A, class = "cell-border stripe",
caption = 'Participation by event type',
rowname = FALSE,
options = list(autoWidth = TRUE, searching = TRUE)
) TODO: add in lessons learned, traps for young players